-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-7411] [SQL] Support SerDe for HiveQl in CTAS #5963
Conversation
Merged build triggered. |
Merged build started. |
Test build #32072 has started for PR 5963 at commit |
Test build #32072 has finished for PR 5963 at commit
|
Merged build finished. Test FAILed. |
Test FAILed. |
Merged build triggered. |
Merged build started. |
Test build #32077 has started for PR 5963 at commit |
Test build #32077 has finished for PR 5963 at commit
|
Merged build finished. Test PASSed. |
Test PASSed. |
@chenghao-intel I have merged #5876. |
889d822
to
f4e243f
Compare
Merged build triggered. |
Merged build started. |
Test build #32302 has started for PR 5963 at commit |
Test build #32302 has finished for PR 5963 at commit
|
Merged build finished. Test PASSed. |
Test PASSed. |
// parquet.hive.DeprecatedParquetInputFormat => Parquet | ||
// TODO configurable? | ||
format.contains("Orc") || format.contains("Parquet") || format.contains("RCFile") | ||
}).getOrElse(false)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure this is really the logic we want. The goal here is to by default (i.e. if the user does not specify anything about storage), when convertCTAS is turned on to use the data sources API. Would it be possible to have the parser only fill in the storage options when the user specifies them and defer filling in default values until we are in the analyzer. That way we can distinguish "no storage options specified" from "default storage options chosen".
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok, I see, I will move the default SerDe
from HiveQl
to Analyzer
This is looking pretty good. Thanks for taking the time to flesh this part out. |
Merged build triggered. |
Merged build started. |
Test build #32343 has started for PR 5963 at commit |
Merged build triggered. |
Merged build started. |
Test build #32344 has started for PR 5963 at commit |
Test build #32343 has finished for PR 5963 at commit
|
Merged build finished. Test PASSed. |
Test PASSed. |
Test build #32344 timed out for PR 5963 at commit |
Merged build finished. Test FAILed. |
Test FAILed. |
retest this please. |
Merged build triggered. |
Merged build started. |
Test build #32355 has started for PR 5963 at commit |
Test build #32355 has finished for PR 5963 at commit
|
Merged build finished. Test PASSed. |
Test PASSed. |
This is a follow up of #5876 and should be merged after #5876. Let's wait for unit testing result from Jenkins. Author: Cheng Hao <[email protected]> Closes #5963 from chenghao-intel/useIsolatedClient and squashes the following commits: f87ace6 [Cheng Hao] remove the TODO and add `resolved condition` for HiveTable a8260e8 [Cheng Hao] Update code as feedback f4e243f [Cheng Hao] remove the serde setting for SequenceFile d166afa [Cheng Hao] style issue d25a4aa [Cheng Hao] Add SerDe support for CTAS (cherry picked from commit e35d878) Signed-off-by: Michael Armbrust <[email protected]>
This is a follow up of apache#5876 and should be merged after apache#5876. Let's wait for unit testing result from Jenkins. Author: Cheng Hao <[email protected]> Closes apache#5963 from chenghao-intel/useIsolatedClient and squashes the following commits: f87ace6 [Cheng Hao] remove the TODO and add `resolved condition` for HiveTable a8260e8 [Cheng Hao] Update code as feedback f4e243f [Cheng Hao] remove the serde setting for SequenceFile d166afa [Cheng Hao] style issue d25a4aa [Cheng Hao] Add SerDe support for CTAS
This is a follow up of apache#5876 and should be merged after apache#5876. Let's wait for unit testing result from Jenkins. Author: Cheng Hao <[email protected]> Closes apache#5963 from chenghao-intel/useIsolatedClient and squashes the following commits: f87ace6 [Cheng Hao] remove the TODO and add `resolved condition` for HiveTable a8260e8 [Cheng Hao] Update code as feedback f4e243f [Cheng Hao] remove the serde setting for SequenceFile d166afa [Cheng Hao] style issue d25a4aa [Cheng Hao] Add SerDe support for CTAS
This is a follow up of apache#5876 and should be merged after apache#5876. Let's wait for unit testing result from Jenkins. Author: Cheng Hao <[email protected]> Closes apache#5963 from chenghao-intel/useIsolatedClient and squashes the following commits: f87ace6 [Cheng Hao] remove the TODO and add `resolved condition` for HiveTable a8260e8 [Cheng Hao] Update code as feedback f4e243f [Cheng Hao] remove the serde setting for SequenceFile d166afa [Cheng Hao] style issue d25a4aa [Cheng Hao] Add SerDe support for CTAS
This is a follow up of #5876 and should be merged after #5876.
Let's wait for unit testing result from Jenkins.